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Abstract: The impact of event colour structure on the performance of the JOHNS- 
HOPKINS, CMS, HEPTOPTAGGER and N-SUBJETTINESS algorithms is investigated by study- 
ing colour singlet and colour octet resonances decaying to top-quark pairs. Large differences 
in top-tagging efficiency are observed due to the different colour charge of each resonance. 
These differences are quantified as a function of the algorithm parameters, the jet size 
parameter and the probability to misidentify light quarks and gluons as top candidates. 
We suggest that future experimental searches would benefit from optimising the choice of 
algorithm parameters in order to minimise this source of model dependency. 
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1 Introduction 

The benefit of using boosted object reconstruction in searches for heavy resonances has 
only been fully appreciated in recent years [1-6]. At the Large Hadron Collider (LHC), 
electroweak-scale resonances can be produced with transverse momentum, prp, much larger 
than the particle's mass, m, due to the large centre-of-mass energy of the colliding protons. 
The decay products of an object produced with ^> m tend to be collimated in the lab 
frame and confined to a small area of the detector. This reduces combinatorial problems in 
reconstructing the object in question, by collecting all radiation from the decay products 
into one 'fat' jet. Jet substructure techniques, which study the energy distribution of 
the constituents inside the fat jet, are then necessary to distinguish between those jets 
originating from boosted objects and the QCD-induced backgrounds from light quark and 
gluon jets. 

The reconstruction of highly boosted top quarks is a particularly well documented 
physics case in which jet substructure techniques are indispensable if a New Physics signal 
is to be extracted from the large QCD background [7, 8]. Several boosted top quark 
reconstruction algorithms have been proposed in the literature (for an overview see [9]). 
These so-called 'top-taggers' make use of two classes of observable. The first class are 
well established event shape observables that have been elevated to jet shape observables 
[10-13]. The most prominent example of a jet shape is the jet mass, upon which all top- 
tagging algorithms rely to some extent in order to discriminate the top signal from light 
quark and gluon backgrounds. The second class of observable is the clustering history 
of the jet constituents. Infrared-safe sequential jet algorithms introduce a metric that 
determines the ordering for jet constituents to be combined into subjets and, eventually, 
into the fat jet. As the radiation profile of quarks and gluons is diff'erent in comparison to 
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the profile produced by the decay of a boosted top quark, tracing backward through the 
recombination history and comparing the energy sharing between the subjets allows us to 
identify signal and background jets [14-17]. 

These top-tagging algorithms, and boosted algorithms in general, only use information 
related to the fat-jet constituents and implicitly assume that the overall hadronic event 
activity is of no major importance. However, radiation from the initial-state, the hard 
interaction and the underlying event can enter the fat-jet. The efficiency for tagging boosted 
objects will therefore depend upon the underlying colour structure in the event and, in 
particular, the colour of the decaying resonance, a fact that has been largely ignored in the 
literature thus far. 

In this article, we study the effect of resonance colour on the efficiency of well es- 
tablished top-tagging algorithms. We focus on s-channel resonances with different colour 
charge, i.e. a colour-singlet Kaluza-Klein (KK) photon, ^j^j^-, and a colour-octet KK gluon, 
Gj^j^. The two resonances produce very different radiation patterns. The colour octet can 
itself radiate gluons and the top quarks are colour connected with the initial state, whereas 
the initial and final states are not inter-connected for a colour singlet resonance. The re- 
sults we will present have direct relevance for ongoing searches for heavy colour singlet 
and colour octet resonances at the LHC [18, 19]. Finally, although we focus on resonances 
that decay into tt pairs, the results we present should be qualitatively similar for boosted 
decays into other electroweak-scale resonances. 

The article is arranged as follows: In Sec. 2 we introduce the jet reconstruction and 
top-tagging algorithms. In Sec 3, we present the Monte Carlo event generator used to study 
top-tagging efficiencies. In Sec. 4 present the performance of the top-tagging algorithms 
for the different signal models. In Sec. 4.3, we discuss the experimental difficulties in 
establishing top-tagging performance in light of these new results. 

2 Boosted top reconstruction 

The JOHNS-HOPKINS top-tagger [14] is a descendant of the two-body approach of [3]. In the 
first step, the energy sharing of subjet splittings is evaluated whilst browsing backwards 
through the recombination history. If a jet splitting (j — )• jij2) produces two subjets that 
satisfy min(prp j^,prj, 2)/Px j ^ (mass-drop) and ARj-^j^ > 6r, the subjets are retained 
as candidates for a top-quark decay. The default parameters are chosen to be 5p = 0.12 
and 5r = 0.16. If a mass drop is observed, each subjet is immediately tested for another 
mass drop. The fat-jet is considered as a top candidate if three or four subjets are kept 
after the double-mass-drop procedure. Kinematic cuts are then applied to these subjets; 
the summed subjet mass should be in a top-mass window, a subjet pair should reconstruct 
the VF-boson mass within a given window and the helicity angle^ for this pair should be 
below a given threshold {cosOh < 0.7). 

The CMS top-tagger is a modification to the JOHNS-HOPKINS algorithm [16]. In the 
first step, the angular separation of the subjets must pass a pT-dependent criterion, i.e. 

^The helicity angle is defined as the angle, measured in the rest frame of the reconstructed W, between 
the direction of the reconstructed top and one of the W decay products. 
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Ai?jjj2 > 6r — A ■ ptJ) with default values of 6r = 0.4 and A = 0.0004. At least three 
subjets have to be present after passing the double-mass-drop procedure and, among the 
three hardest subjets, the two subjets with the smallest invariant mass are required to 
be consistent with the W^-boson mass. The top candidate is defined as the three or four 
hardest subjets. 

The HEPTOPTAGGER was proposed for the reconstruction of mildly boosted top quarks 
in a busy environment [17]. The first step aims to find the complete hard substructure of 
the fat jet, irrespective of the expected number of top decay products, and does not stop 
after a certain number of mass-drop conditions are met. It browses through all branches 
of the jet recombination history comparing at every step the relative mass sharing of the 
jij2 — )• j mergings. The condition min(ji,j2) < 0.8 • rrij determines if ji and j2 are kept 
as candidates for a heavy object decay. Subjets with mj < 30 GeV are not unclustered 
further and are stored as potential top decay products. In the second step, all three-subjet 
combinations are filtered, similar to [3], and the combination that is closest to the top mass 
within a specific top mass window is considered to be the top candidate. The subjets of 
the top candidate are required to pass three criteria: 
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The default values of the dimensionless mass window bounds are set to i?min = 
and i?max = 1.15 • m^/m^. 

The N-SUBJETTINESS top-tagger [10] is an adaption of the event shape observable N- 
jettiness [20]. The observable aims to quantify the degree to which jet radiation is aligned 
along specific subjet axes. Smaller values of tn correspond to N or fewer energy deposits 
inside the fat jet, while large values of tm indicate more than energy deposits. After 
using the exclusive fcx algorithm [21] to determine the candidate subjet directions, tm is 
calculated by 



TN 



(2.2) 



The summation is over all jet constituents, pr^ ^ is the transverse momentum of each jet 
constituent, Ai?^ ^ is the distance in rj x (p space between the subjet axis A and each jet 
consituent, and Rq is the jet radius. The ratio tn/tn-i provides a good separation of top 
jets from light quark and gluon jets. The parameters used to define top candidates are 
tn/tn~i and the fat-jet mass. 
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3 Monte Carlo event simulation 



To study the effect of colour flow on the top-tagging performance, we simulate the produc- 
tion of heavy gluons and heavy photons in proton-proton collisions at ^/s = 14 TeV. The 
production of heavy gluons decaying to top-quark pairs {qq — )• Gkk — ^ ti) is simulated 
using the pythia 8 event generator, with the specific implementation documented in [22]. 
The gluon resonance couplings are chosen to be gl = 0.2, 5^ = and (7* = 3.6, and the 
resonance mass is chosen to be 2 TeV. This results in a resonance width of T /M = 0.2 
and a cross section x branching-ratio of 1.1 pb at ^/s = 14 TeV. Heavy photon production 
is simulated by changing the colour factors, — >• e^aem, where is the quark electric 
charge. The couplings of the heavy photon to light quarks and top quarks are then adjusted 
to reproduce heavy gluon production rate and resonance width, respectively. 

The events are generated with the CTEQ5L parton distribution functions [23]. Par- 
ton showering, hadronisation and underlying event from multiple parton interactions are 
generated with the default tune to pythia 8.130 (Tune 1) [24]. For final states contain- 
ing top-quarks, it is possible to simulate events in pythia 8 with two different parton 
shower options: 'wimpy showers' allow radiation up to the factorisation scale of the pro- 
cess, whereas 'power' showers allow radiation up to the kinematic limit. We found that 
the former option gave a much better description for the jet activity in Standard Model 
(SM) ti events when compared to the ATLAS data [25] and 'wimpy' showers are therefore 
used throughout this study. The parton shower comparison to the data is documented in 
Figure 7 of Appendix A. 

The probability that an algorithm wrongly identifies jets originating from light quarks 
and gluons as top-candidates is also assessed, to allow comparisons between different top- 
tagging algorithms for a fixed mis-tag probability. This light quark and gluon background 
is also simulated using pythia 8 with the default underlying event tune. 

4 Performance of top-tagging algorithms 
4.1 Basic kinematic features of the signal events 

The Cambridge-Aachen (CA) algorithm [26] as implemented in Fastjet [27] is used to re- 
construct jets using all stable final state interacting particles^ with |r/| < 4.9, which is 
the coverage of the ATLAS and CMS calorimeters. The default CA distance parameter is 
R = 0.8, unless otherwise stated, and full four momentum recombination is used. Jets are 
initially kept for further analysis if they have > 350 GeV. Figure 1 shows the trans- 
verse momentum of the leading and subleading jets. Following a dijet mass requirement, 
1.6 < Mjj < 2.4 TeV, the transverse momentum of the leading/subleading jets from heavy 
gluon and heavy photon resonance decays are very similar. Any differences in top-tagging 
efficiency will therefore not be a consequence of different kinematic distributions. 

^i.e. excluding neutrinos 
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Figure 1. Transverse momentum of the leading (a) and subleading (b) top-jet candidates as 
reconstructed using the Cambridge- Aachen algorithm with R = 0.8. The KK-gluon, KK-photon 
and SM tt events are shown in blue, red and green, respectively. Two distributions are shown for 
each sample, with no specific cut on the dijet invariant mass (solid lines) and after a dijet invariant 
mass cut of 1.6 < Mjj < 2.4 TeV (dashed lines). 



4.2 Dependence of top-tagging efficiency on event colour structure 

We define tfie single and double tagging efficiencies, ei and e2, as 

where A'^2j is the number of events that contain two CA jets with > 500 GeV and 
\r]\ < 2.5. A^it is the subset of these dijet events for which the leading jet is identified as 
a top-candidate using one of the algorithms presented in Sec. 2. N2t is the subset of the 
dijet events for which the leading and subleading jets are identified as a top-candidates. 

The top-tagging efficiency depends on an educated guess as to the jet substructure 
that resembles a top quark decay. Figure 2 shows the relative difference between the 
double tagging efficiency for KK gluon and KK photons as a function of two parameters 
for each algorithm. The algorithm parameters are discussed in Sec. 2. It is immediately 
apparent that there is a difference between the double tagging efficiencies for colour octet 
and colour singlet resonances. This difference varies between 0% and 75%, depending on 
the algorithm and the parameter choice. We find the largest efficiency differences occur 
when a high background rejection rate is targeted, i.e. for small mass windows and/or tight 
parameter cuts. This efficiency difference is directly due to the different colour structure 
in the events: There is more radiation present in events containing heavy photons, as 
observed in previous work [22, 28]. This difference in quark/gluon activity in each type of 
event means that there is more chance of radiation into the jets and hence more chance 
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HEPTopTagger N-subjettiness 




Figure 2. EfRciency difference for double top-tagging efficiencies for the (a) JOHNS-HOPKINS, (b) 
CMS, (c) HEPTOPTAGGER and (d) N-SUBJETTINESS algorithms. The difference of the efficiencies is 
presented as a function of the cuts applied to the algorithm input parameters. For example, mtop±^ 
refers to the size of the window for which the top mass reconstructed from jet constituents is required 
to satisfy. Details pertaining to the top mass reconstruction and other algorithm parameters are 
given in Sec. 2. 

of failing various sub jet requirements. For each algorithm, parameters can be chosen that 
yield the same average tagging efficiency for a color octet and a color singlet resonance. 
However, these parameters tend to correspond to very loose cuts on the subjets and would 
have poor background rejection. 
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Signal tagging efficiencies are usually quantified as a function of the mis-tag probability 
for the light quark and gluon background, so that the performance of each algorithm can be 
compared at similar working points. To do this, the two-dimensional tagging efficiencies 
for each signal and the background are projected onto one dimensional slices, allowing 
efficiency/mis-tag curves to be constructed. The choice of projection is somewhat arbitrary 
and so we choose three projections for each algorithm. For the johns-hopkins and CMS 
algorithms, efficiency /mis-tag curves are constructed by varying (i) only the top mass 
window (corresponding to a horizontal slice at mw i 30 GeV), (ii) only the W mass 
window (corresponding to a slice at rrit =b 50 GeV), or (iii) both the top and the W mass 
windows simulataneously such that the W mass window is approximately 80% of the size 
of the top mass winsow (corresponding to a slice across the diagonal of Figure 2 (a) ) . For 
the N-SUBJETTINESS and HEPTOPTAGGER algorithms, we vary the T3/T2 cut and m^^/m^ 
window, rather than the W mass window. All other parameters of the algorithms are fixed 
to the default values. 

Figure 3 shows the double top-tagging efficiency for heavy gluons and heavy photons as 
a function of the mis-tag probability for each top-tagging algorithm. The efficiency versus 
mis-tag curves for each signal are highly dependent on the choice of projection. As the 
signal efficiency at a given mis-tag working point is dependent on the choice of parameters, 
the algorithm parameters need to be optimised in any given search. We do not discuss 
the absolute values of the signal efficiencies further in this article, but instead focus on 
the tagging efficiency differences for KK gluons and KK photons. In this respect, we find 
that the JOHNS-HOPKINS tagger does not show a strong dependence on the parameter 
varied, producing similar large efficiency differences if the mis-tag probability is required 
to be small. This is not true for the other taggers. For the n-subjettiness and CMS 
taggers, the biggest tagging differences are found by varying the top mass window only. 
Conversely, the heptoptagger is almost completely insensitive to the variation of the top 
mass window, being more dependent on the mass plane window variation, which can result 
in a 15% double tagging efficiency difference between KK gluons and KK photons. This 
implies that searches could be designed to minimise the model-dependency for any given 
mis-tag working point by choosing algorithm parameters carefully. It is worth noting that, 
in searches for heavy resonances, working points corresponding to low mis-tag probabilities 
(< 1%) are favoured to overcome the large QCD backgrounds. However, this is exactly 
the region that yields the largest tagging efficiency differences between KK gluons and KK 
photons, typically ranging from 15% to 50%. 

The difference in tagging efficiencies is also present when tagging just the leading jet 
in the event, as shown in Figure 4. This demonstrates that the different colour structure is 
affecting both jets, not just the subleading jet. This is particularly important for searches 
in a semi-leptonic channel, in which only one jet is tagged as a top jet. It also implies 
that a tight/loose cut combination on the leading/subleading jets would not reduce the 
observed model dependency. 

The size of the fat jet is a crucial parameter for any subjet analysis. In Figure 5, we 
examine the difference in the double tagging efficiency between heavy gluons and heavy 
photons as a function of the CA distance parameter, R. We show the tagging efficiency 
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Figure 3. Efficiency of correctly tagging the leading and subleading jets from the decay of a 
gluon resonance (blue) and photon resonance (red), using the (a) johns-hopkins, (b) CMS, (c) 
HEPTOPTAGGER and (d) N-SUBJETTINESS algorithms. The efficiency is presented as a function 
of the double mis-tag probability, which is the probability of misidentifying two light jets as top- 
candidates. The efficiency and mis-tag probabilities are estimated by varying the cuts on the 
algorithm parameters. The ratio plots show the relative signal efficiencies (e|'"°"/e2^°*°" — 1) as a 
function of the mis-tag probability for each algorithm and parameter variation. 
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Figure 4. EfBciency of correctly tagging the leading jet from the decay of a gluon resonance 
(blue) and photon resonance (red), using the (a) johns-hopkins, (b) CMS, (c) heptoptagger 
and (d) n-SUBJETTINESS algorithms. The efficiency is presented as a function of the single mis-tag 
probability, which is the probability of misidentifying the leading jet in a light-quark or gluon sample 
as the top-candidate. The efficiency and mis-tag probabilities are estimated by varying the cuts on 
the algorithm parameters. The ratio plots show the relative signal efficiencies (|ggi"°"^gPhotoii _ j^-j 
as a function of the CA distance parameter for each top-tagging algorithm. 
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Figure 5. Efficiency of correctly tagging the leading and subleading jets from the decay of a 
gluon resonance (blue) and photon resonance (red), using the (a) Johns-Hopkins, (b) CMS, (c) 
HEPTOPTAGGER and (d) N-subjettiness algorithms. The efficiency is presented as a function of 
the CA distance parameter used to reconstruct the jets. The efficiency and mis-tag probabilities 
are estimated by varying the cuts on the algorithm parameters. The ratio plots show the relative 
signal efficiencies (e|'"°'^/e2'^°'°" — 1) as a function of the mis-tag probability for each algorithm and 
parameter variation. 
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at each value of R for three efficiency working points defined by the background mis-tag 
probabiUties (0.01%, 0.1%, 1%). The algorithm parameters chosen to obtain those working 
points are presented in Table 1. In general, the efficiency increases slightly as R decreases. 
This is somewhat counter intuitive because a larger jet is more likely to capture all top 
decay products. However, in this study, the top quarks are typically boosted enough 
for their decay products to be captured by a jet size of i? ~ 0.8 and increasing the jet 
size simply allows more soft radiation to enter the jet. Finally, although the top-tagging 
efficiency only slightly decreases for larger R, the difference in tagging efficiencies between 
the KK gluon and KK photon increases rapidly. Optimising the jet size as a function of the 
top quark transverse momentum has already been suggested in the literature [29]. These 
results imply that the minimal possible value of R should be chosen, in order to minimise 
the dependence on the colour structure. 

4.3 Impact on experimental analyses 

The dependence of the top-tagging efficiency on the event colour structure raises a number 
of experimental issues relating to the use of top-tagging algorithms in the search for, and 
possible discovery of, new phenomena. 

In searches for heavy particles decaying to ti, the sensitivity to the resonance's colour 
charge poses an experimental challenge in assessing the systematic uncertainties of the 
search. The relevant issues are clarified by scrutinising the recent CMS search for resonances 
decaying to top quark pairs [18]. The single top-tagging efficiency for a colour singlet 
resonance was estimated to be 50% for a mis-tag probability of 5%, using the CMS algorithm 
(R = 0.8, fat-jet p-p > 500 GeV). The systematic uncertainty in the tagging efficiency was 
estimated to be 3% (6%) for a single (double) tag, by comparing the I^-tagging efficiency 
in data to MC using a sample of moderately boosted SM events. The same methodology 
also yielded a scale (normalisation) factor of 0.97 for MC events, to recover the central 
tagging efficiencies observed in the data. These scale factors and systematic uncertainties 
were then used to normalise all signal and background samples. 

It is not immediately obvious that universal scale factors and systematic uncertain- 
ties can be used, given the colour flow dependency discussed in the previous section. The 
problem is that the data/MC agreements are observed in a control region, but the tagging 
performance can change dramatically between this control region and the signal regions. 
In the case of the CMS analysis, the efficiency scale factors and systematic uncertainties 
are determined from PF-tagging in a sample of tt events and then assumed to be directly 
applicable to (i) top-tagging in the background MC samples and (ii) top-tagging in the 
signal MC samples. Both of these assumptions directly impact on the observed limit (al- 
though the dominant effect on the limit is usually from the background estimation). We 
note that the application of the scale factors and uncertainties to the background tt sample 
is not unreasonable because the colour flow in the control and background samples will be 
similar. However, the colour flow is very different between the control and signal regions, 
resulting in different tagging efficiencies as shown in Figure 6; the difference between the 
double tagging efficiency for the SM ti events and the KK photon (gluon) is approximately 
20% (35%) at the nominal mis-tag working point of 0.25% (= 5%^). Extrapolating from 
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Figure 6. Double-tag efficiency and efficiency difference for the CMS tagger and the HEPTOPTAG- 
GER. We compare the KK-photon and the KK-gluon with the SM tt background. 



the control region to the signal region therefore requires much confidence in the MC gen- 
erators. Finally, we note that the dependency on MC could be reduced in future analyses 
by optimising the choice of tagging algorithm and/or tagging parameters. For example, 
Figure 6 also shows that the heptoptagger has much smaller differences between control 
and signal regions. 

The situation would become even more problematic if a heavy resonance was observed 
at the LHC. Without prior knowledge of the resonance colour, the cross-section x branching- 
ratio could not be extracted in a model independent way. It is worth noting that methods 
to extract the colour of a heavy resonance have been discussed in the literature [22, 28, 30- 
32]. Using one or more of these methods would be mandatory to reduce uncertainties 
related to the tagging efficiency. 

The dependence of the tagging efficiency on the colour flow will impact upon other 
searches if boosted tagging algorithms are used for signal identification. We briefly highlight 
two other search channels of current interest. The first case is the search for scalar top 
(stop) pair production in which the (light) stops are produced in a boosted topology [33]. 
In the worst case scenario, one or more of the stops are produced via a decay chain (e.g 
from gluino decays). In this case, the colour flow is very different to SM tt production and 
the determination of the signal tagging efficiency and associated systematic uncertainty 
will have to fully rely on the MC simulation. The second case is the search for a very 
broad resonance (i.e. with TeV width) decaying to top-quark pairs. In this scenario, 
distinguishing between the signal and background events is not possible and the background 
control regions will be contaminated by the signal. In this case, the MC will have to be fully 
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relied upon to untangle the relative efficiency differences between signal and background. 
5 Summary and outlook 

The impact of event colour structure on the performance of standard top-tagging algorithms 
was investigated by studying Monte Carlo simulations of colour singlet and colour octet 
resonances decaying to top-quark pairs. Large differences in top-tagging efficiency were 
observed due to the different colour charge of each resonance. Attempting to reduce the mis- 
tag probability, by tightening the algorithm cuts, leads to the largest differences in the top- 
tagging efficiency for colour singlet and colour octet resonances. The difference in tagging 
efficiency was also found to be largest for large values of the distance merging parameter 
(i?) used to reconstruct the CA jet. The algorithm with the least overall sensitivity was 
the HEPTOPTAGGER, which is the only algorithm with a built in jet grooming procedure to 
remove contamination from soft-radiation into the jet. Optimising the algorithm and choice 
of parameters would minimise this model dependency in future experimental searches. 

The model dependency directly impacts on the searches that use boosted top-tagging 
algorithms. Furthermore, in the advent of a discovery, the model dependence will prevent a 
precise extraction of the cross section x branching ratio for the newly observed resonance. 
The problem, experimentally, is related to the lack of a suitable control region to determine 
the tagging efficiency for the signal events and the measurements may have to rely on the 
simulation provided by the MC event generators. 

The observation that the tagging efficiency depends strongly on the events colour flow 
has wider relevance than the case presented in this article (searching for narrow resonances 
decaying to top quark pairs). For example, the search for the pair production of light scalar 
tops using boosted reconstruction techniques would also suffer from the lack of identifiable 
control regions for signal. Furthermore, the search for a broad resonance decaying to 
top-quark pairs would have a background control region contaminated by the signal and 
untangling the efficiency differences would be entirely dependent on the MC models. 

Acknowledgments 

The authors would like to thank Sarah Livermore and Mark Owen for interesing discussions. 
This work was funded in the UK by STFC. 

Appendix A 

For tt final states produced using the pythia 8 event generator, it is possible to choose two 
parton showering options; 'wimpy' showers allow QCD radiation up to the factorisation 
scale, whereas power showers allow radiation up to the kinematic limit. We compared 
the results of using the two parton shower options to the ATLAS data [25] in Fig. 7 and 
concluded that the wimpy showers give a better description of the data. Wimpy showers 
were therefore used for the MC event generation in this study. 



- 13 - 




Figure 7. The fraction of ti events that do not contain an additional jet with px > Qo- The 
ATLAS data [25] are compared to the predictions from pythia 8 using the so-caUed 'power' (red) 
and 'wimpy' (blue) parton shower options. The data clearly favour the wimpy shower, which was 
subsequently used for all the MC event generation in this study. 
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